Rate equation approach for correlations in growing network models 
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We propose a rate equation approach to compute two vertex correlations in scale-free growing 
network models based in the preferential attachment mechanism. The formalism, based on previous 
work of Szabo et al. [Phys. Rev. E 67 056102 (2002)] for the clustering spectrum, measuring three 
vertex correlations, is based on a rate equation in the continuous degree and time approximation for 
the average degree of the nearest neighbors of vertices of degree k, with an appropriate boundary 
condition. We study the properties of both two and three vertex correlations for linear preferential 
attachment models, and also for a model yielding a large clustering coefficient. The expressions 
obtained are checked by means of extensive numerical simulations. The rate equation proposed 
can be generalized to more sophisticated growing network models, and also extended to deal with 
related correlation measures. As an example, we consider the case of a recently proposed model of 
weighted networks, for which we are able to compute a weighted two vertex correlation function, 
taking into account the strength of the interactions between connected vertices. 

PACS numbers: 89.75.-k, 87.23. Ge, 05.70.Ln 



I. INTRODUCTION 

Many natural and man-made complex systems can be 
fruitfully represented and studied in terms of networks or 
graphs 1] , in which the vertices stand for the elementary 
units that compose the system, while the edges picture 
the interactions or relations between pairs of units. This 
topological representation has found many applications 
in fields as diverse as the Internet 0, the World-Wide 
Web , biological interacting networks 0, 0, Q or social 
systems , leading to the development of a new branch 
of statistical mechanics, the modern theory of complex 
networks II®. 

The empirical study of real complex networks, pro- 
moted by the recent accessibility to computers powerful 
enough to deal with very large databases, has uncovered 
the presence of some typical characteristics. The three 
most relevant of these are: (i) The small-world property 
[Tof , defined by an average shortest path length — average 
distance between any pair of vertices — increasing very 
slowly (logarithmically or slower |ll| ) with the network 
size N. (ii) The presence of a large transitivity 0, which 
implies that two neighbors of a given vertex are also con- 
nected to each other with large probability. Transitivity 
can be quantitatively measured by means of the clus- 
tering coefficient Cj of vertex i lOj, defined as the ra- 
tio between the number of edges TOj existing betwen the 
ki neighbors of i, and its maximum possible value, i.e. 
Cj = 2mi/(ki(ki — 1)). The average clustering coefficient, 
defined as C = J2i C i/N , usually takes quite large values 
in real complex networks, (iii) A scale- free behaviour for 
the degree distribution P(k) (HQ, defined as the proba- 
bility that a vertex is connected to k other vertices (has 
degree k), that shows a power-law behavior 



P{k) 



(1) 



where 7 is a characteristic degree exponent, usually in 
the range 2 < 7 < 3. A major role is especially played 
by the scale-free nature of many real complex networks, 
which implies a large connectivity heterogeneity, at the 
basis of the peculiar behavior shown by dynamical pro- 
cesses taking place on top of these networks, such as the 
resilience to random da mage flp. IT3L Il4| . the spreading of 
infectious agents |15l Ho . 1171 Il8j . or diffusion- annihilation 
processes fl3.l20l|. 

/From a theoretical point of view, the empirical re- 
search has inspired the proposal of new network mod- 
els, aimed at reproducing and explaining the properties 
exhibited by complex networks. In this respect, many 
efforts have been devoted to develop models capable to 
account for a scale-free degree distribution. Classical net- 
work modeling was previously based on the Erdos-Renyi 
random graph model plll22T |. which is a static model (i.e. 
defined for a fixed number of vertices N) yielding small- 
world networks with a Poisson degree distribution. A 
change of perspective in nework modeling took place after 
the introduction of the preferential attachme nt p aradigm 
first proposed by Barabasi and Albert (BA) [23j . The in- 
sight behind this concept is the realization of two facts: 
Firstly, most complex networks are the result of a growth 
process, in which new vertices are added in time to the 
system. Secondly, new edges are not placed at random, 
but tend to connect to vertices which already have a large 
degree. It turns out that these two ingredients are able to 
reproduce scale-free degr ee distributions with a tunable 
degree exponent [23I l24j]. Moreover, it has been shown 
that not all sorts of preferential attachment are able to 
generate a power law degree distribution, but only those 
in which new edges attach to vertices with a probability 
strictly linear in their degree |25| , and that some alterna- 
tive mechanisms, such as the copying model [26] implic- 
itly define a linear preferential attachment dynamics. 
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While a proper characterization and understanding of 
the origin of the scale-free degree distribution displayed 
by most real complex networks is a fundamental task, 
it has been recently realized that this property does not 
provide a sufhent characterization of natural networks. 
In fact, these systems seem to exhibit also ubiquitous 
degree correlations, which translate in the fact that the 
degrees of the vertices at the end points of any given edge 
are not independent [23, |28|, |2!J |3(| ■ Two vertex degree 
correlation can be conveniently measured by means of the 
conditional probability P(k'\k) that a vertex of degree k 
is connected to a vertex of degree kl . For uncorrelated 
networks, in which this conditional probability is inde- 
pendent of k, it can be estimated as the probability that 
any edge end points to a vertex of degree k 1 , which is 
simply given by P n . c .(k'\k) = k'P{k')/{k) [p. The em- 
pirical evaluation of P(k'\k) in real networks is usually 
a cumbersome task, restricted by finite size data yield- 
ing noisy results. For this reason, it is more practical to 
analyze instead the average degree of the nearest neigh- 
bors of the vertices of degree k, which is formally defined 
as 271 



l {k)=Y J k'P{k'\k). 



(2) 



For uncorrelated networks, in which P(k'\k) does not de- 
pend on k, we have 



(A: 2 



(k) 



(3) 



independent of k. Thus, a k nn (k) function with an ex- 
plicit dependence on the degree signals the presence of 
two vertex degree correlations in the network. When 
k n n{k) is an increa sing function of k, the network shows 
assortative mixing [29j (vertices of large degree connected 
more preferably with vertices of large degree, and vice- 
versa). Negative correlations (low degree vertices con- 
nected preferably with large degree vertices), on the other 
hand, give rise to dissasortative mixing, detected by a de- 
creasing k nn (k) function. 

Analogously to two vertex correlations, correlations 
implying three vertices can be mesured by means of the 
probability P(k',k"\k) that a vertex of degree k is si- 
multaneously connected to vertices of degree k' and k" . 
Again, the difficulties in directly estimating this condi- 
tional probability can be overcome by analyzing the clus- 
tering coefficient. The average clustering coefficient of 
the vertices of degree k (the clustering spectrum), c(k) 
can be formally computed as the probability that 
a vertex of degree k is connected to vertices of degree k' 
and k" , and that those two vertices are at the same time 
joined by an edg e, averaged over all the possible values 
of k' and k" [3l|]. Thus, we can write c(fc) as a function 
of the three vertex correlations as 



c(k)= P(k',k"\k)p k k ,^„ 



(4) 



where p\, k ,, is the probability that vertices k! and k" 
are connected, provided that they have a common neigh- 
bor k j50|. From this expression, the average clustering 
coefficient can be computed as 



C = J2P(k)c(k). 



(5) 



For uncorrelated networks, we have that P n c (k' , k"\k) 
Pn.c.{k'\k)Pn. c .(k"\k) [H, and V \, k „ = (k' - l)(fc" 
l)/{k)N Therefore we obtain ' 



.(*) 



((fc 2 ) ~ 
(k) 3 N 



(6) 



k',k" 



That is, c n . c .(fc) is independent of k and equal to the 
average clustering coefficient C [3^ . A functional depen- 
dence of c(k) on the degree can thus be attributed to the 
presence of a structure in the three vertex correlations. 
In particular, for scale-free networks it has been observed 
that in many instances, the clustering spectrum exhibits 
also a power-law behavior, c(k) ~ k~ a . A value of a close 
to 1 has been empirically observed in several natural net- 
works, and analytically found in some growing network 
models [12, HH ■ These findings have led to propose 
the clustering spectrum c(fc) as a tool to measure hierar- 
chical organization and modularity in complex networks 

M 

The presence of correlations are thus a very relevant 
issue in order to understand and classify complex net- 
works, especially in view of the important consequences 
that they can have on dynamical processes taking place 
on the topology defined by the networks [3(| |33, HU- 
While there are quite a few empirical results for real net- 
works, the situation is not the same for network models, 
and therefore there is no consensus regarding the origin 
of assortative and dissasortative mixing, and its relation 
with the power law behavior of the clustering spectrum 
c(k). In fact, most works devoted to analytical calcu- 
lations of correlations in complex network models have 
been performed only for particular cases [3, 13^, 0, HU . 
In this respect, it is noteworthy the rate equation formal- 
ism proposed by Szabo et al. in Ref. [3j| (see also |39|) 
to compute c(k) in growing network models with prefer- 
ential attachment. However, and up to our knowledge, 
no such formalism has been developed to deal with two 
vertex correlations, as given by the k nn {k) function. 

In this paper we revise the formalism proposed in 
Ref. for computing the clustering spectrum in grow- 
ing networks models with preferential attachment. Re- 
considering the mean field rate equation in the continu- 
ous degree approximation for the c(k) presented in [34| . 
we are able to provide a general expression for the bound- 
ary condition of this rate equation, which was neglected 
in the original treatment and which can have as a mat- 
ter of fact relevant effects in the final solution, as we will 
show below. Inspired by this result we also propose a new 
rate equation for two vertex correlations, as measured 
by the k nn {k) function, and work out the correponding 
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boundary condition. We remark that both equations are 
valid in general for the so-called "citation networks" 0, 
in which neither edge or vertex removal nor edge rewiring 
are allowed. The general formalism is presented in sec- 
tion [D] The rate equations obtained can be easily solved 
for growing networks with linear preferential attachement 
(LPA) 24] , as shown in section IIIII In particular, we are 
able to obtain expressions for the dependence of the cor- 
relations on the degree k and the system size N, for both 
the dissasortative and assortative regimes of the model, 
which are in very good agreement with numerical simu- 
lations and previous scaling arguments jijj. LPA mod- 
els generate networks with a vanishing average clustering 
coefficient C. In order to asses the effects of a nonzero 
clustering, we study in section llVl a growing model pre- 
senting a large final clustering coefficient [41| . which we 
are able to compute with very good accuracy. The results 
obtained are qualitatively similar to those shown by the 
Holme-Kim model [34l I42 ] . The rate equation proposed 
for two vertex correlations can be easily generalized to 
deal with more involved situations. As an example of its 
flexibility, we examine in section [V] a recently proposed 
model for the evolution of weighted complex networks 
|43j| . In this case, we extend our formalism to compute 
a function estimating weighted two vertex correlations, 
in which the actual strength of the interactions between 
neighboring vertices is taken into account. Our results 
allow us to discuss the scaling form of two and three ver- 
tex correlation functions, and signal the possible relations 
that can be established between them. 



II. RATE EQUATIONS FOR CORRELATIONS 
IN GROWING NETWORKS 

Let us consider the class of growing network models in 
which, at each time step, a new vertex with m edges is 
added to the network. For the vertex introduced at time 
t, each of its emanating edges is connected to an existing 
vertex introduced at time s (s < t) with a connection 
probability H s ({k},t), which is assumed to depend only 
on the degrees of the existing vertices at time t, {k} = 
{ki(t), . . . kt-x(t)}. Time runs from 1 to JV (the final 
network size) , and since for each new vertex m edges are 
added, the average degree is fixed and given by (k) = 2m. 
In the continuous k and t approximation ^(J i the average 
degree that the vertex s (i.e. the vertex introduced at 
time s) has at time t (t > s) can be computed from the 
rate equation 



dk s (t) 
dt 



mii s ({k},t), 



(7) 



with the boundary condition k s (s) — m (initially all ver- 
tices have m connections). From k s (t), the degree distri- 
bution can be obtained as 



P(M) 



dk s (t) 
ds 



(8) 



where s(k,t) is the solution of the implicit equation k = 
k s (t). 

For this class of networks it is possible to obtain a rate 
equation for the clustering spectrum. Following Ref. |34j |. 
we recall that the clustering coefficient c s (t) of vertex s 
at time t is defined as the ratio between the number of 
edges between the neighbors of s and its maximum pos- 
sible value. Then, if M s (t) is the number of connections 
between the neighbors of s at time t, we have that 



,(*) 



2M s (t) 



k s (t)[k s (t) - iy 



(9) 



During the growth of the network, M s (t) can only in- 
crease by the simultaneous addition of an edge to s and 
one of its neighbors. Therefore, in the continuous k ap- 
proximation, we can write down the following rate equa- 
tion pi ]: 



dM s {t) 
dt 



m(m-l)n s ({k},t) n j({ fc }^) 



(10) 



where V(s) is the set of neighbours of vertex s. In or- 
der to solve this equation we must provide additionally 
a boundary condition. To do so, we observe that M s (s) 
is the number of triangles created by the introduction of 
vertex s. Therefore 



M s (s) 



m(m — 1) 



]T n J ({fc}, s )n„({/c}, s )n J ,„, 



(ii) 

that is, it is proportional to the probability that s is 
connected to vertices j and n, times the probability Hj. n 
that j and n are linked, averaged over all vertices j and 
n existing in the network at time s. The probability IL;, n 
is given by 



n 



j,n - S(j - n)mn„({fe}, j) + 6(n - j)mU 3 ({k}, n), 



where Q(x) is the Heaviside step function. Solving the 
equation for M s (t) with the boundary condition Eq. I|ll|l , 
we can obtain the clustering c s (t) from Eq. JSJ- Then, 
since in growing network models in the continuous k ap- 
proximation the degree at time t is uniquely determined 
by the introduction time s, from c s (t) we can directly ob- 
tain the clustering spectrum c(fc) as a function of k and 
the largest time t = N. 

In the case of the two vertex correlation function 
knn(k), we can proceed along similar lines. Let us de- 
fine R s (t) as the sum of the degrees of the neighbors of 
vertex s, evaluated at time t. That is, 



R s (t)= Y, 



(13) 



jev(s) 



s—s(k,t) 



The average degree of the neighbors of vertex s, k nn (s) is 
then given by k nn {s) = R s (t)/k s (t). During the growth 
of the network, R s (t) can only increase by the adjunction 
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of a new vertex connected either directly to s, or to a 
neighbour of s. In the first case R s {t) increases by an 
amount to (the degree of the newly linked vertex), while 
in the second case it increases by one unit. Therefore, in 
the continuous k approximation, we can write down the 
following rate equation: 



dR s {t) 
dt 



iU s ({k},t)}+m £ n,({fc},i). (14) 

jev(s) 



In order to obtain the boundary condition for this equa- 
tion, we observe that, at time s, the new vertex s con- 
nects to an old vertex of degree kj(s) with probability 
IIj({fc},s), and that this vertex gains a new connection 
in the process. Therefore, 



R s (s)=mJ2^j({k},s)[k j (s) + l}. 



(15) 



^From the solution of this rate equation, we can obtain 
knn(s) and from it the two vertex correlation function by 
the functional dependence of s on k and t — N. 

We must note that Eqs. ifTUJl. fTTft . JT2), O, and JTSJ 
are only valid for the so-called "citation networks" , in 
which neither edge removal nor rewiring [44| are allowed, 
since these two processes can induce nonlocal variations 
in the conectivity of the nearest neighbors. 



III. LINEAR PREFERENTIAL ATTACHEMENT 
MODELS 

As an example of the application of the rate equations 
presented in the previous Section we consider the gen- 
eral LPA model proposed in Ref. [2J] , for which the rate 
equations for R s (t) and M s (t) can be closed and solved 
analytically. For general LPA, the connection probability 
takes the form 



U s ({k},t) = 



b 1 k s (t)+b 2 

E 3 -[Mi(t) + &2] ! 



(16) 



where bi are real constants. Since, for each new vertex, 
m edges are added to the network, the normalization 
constant in Eq. (|16fl takes the form ^2j[bikj(t) + b 2 ] = 
(2m&i+&2)£- Thus, the model depends only on the tuning 
parameter a = bz/bx, taking values in the interval a € 
] — to, 00 [ (since the minimum degree is to, a cannot be 
lower than — m in order for II S ({k}, t) to remain positive). 

Solving the rate equation for the degrees Eq. Q, we 
obtain 



, t \ P TO 
k s (t) = (m + a) - -a, (3 = 

s I 2m + a 



(17) 



Therefore, this model yields networks with a power law 
degree distribution of the form 



For a > 0, we obtain a degree exponent 7 > 3, which 
corresponds to finite degree fluctuations in the thermo- 
dynamic limit. The case a = recovers the original BA 
model with 7 = 3 |23| . Finaly, values — to < a < yield 
scale free networks with a tunable degree exponent, in 
the range 7 s]2, 3[. 



A. Two vertex degree correlations 



The rate equation for R s (t) takes in this case the form 



dR 3 (t) 
dt 



2 k s (t) + a t-^ kj (t) + a 

— jyi -|- \ ffi — 

(2m + a)t (2m + a)t 

p {m + a)k s (t) + am | ^ R s (t) 



where we have used the definition of R s (t) , Eq. (|13|l . The 
general solution of the previous equation is 

R s (t) =A a (s)t p + P(m + a) 2 \nt + a 2 (20) 

where Aq(s) is given by the boundary condition R s (s). 
From Eq. I|15|) . we have that 

' a+(a + l)fc J -(s) + fc 2 (s) 
R s (s) = TO^ — 



(2to + a)s 



(3a + 2m(3{a+l) + -^k 2 3 {s) . (21) 



3=1 



P(k) ~ fc -7 , 7 = 3 + a/TO. 



(18) 



Plugging in kj(s) — (to + a)(s/j)' 9 — a into R s (s) results 



R s (s) =m(l-a)+(3(m + a) 2 s 2(3 - 1 J2r 2P - (22) 

i=i 

In order to estimate the behaviour of the previous ex- 
pression, we have to distinguish the different cases corre- 
sponding to the possible values of (3 (namely a). 

(i) —to < a < (i.e. > 1/2, 7 < 3). In this case, for 
large s, Ylj=i J _2/3 — C(2/3), where ((x) is the Riemann 
Zeta function, and thus, at leading order, 

R s (s) ~ /3C(2/3)(m + afs 2 ^ 1 . (23) 

The determination of the integration constant Aq(s) from 
Eq. ||5n|l yields then 

R s (t) ~ pC(2f3)(m+a) 2 t^s^- 1 +f3(m+a) 2 hx^ 

(24) 

where terms independent of t and s and terms going to 
zero in the large t or s limit have been neglected. From 
the definition of k nn (s), and substituting s as a function 
of k and t = N (the network final size) in the limit of 
large k and N we obtain the following expression for the 
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average degree of the neighbors of the vertices of degree 
k: 



k nn (k,N) ~ / 3C(2/3)(™ + a) 3 - 1 / /3 iV^- 1 fc- 2 + 1 ^ 
+ (to + a) In 



to + a 



(25) 



^From this expression, we conclude that the LPA with 
a < yields in the large N limit networks with dis- 
sasortative two vertex correlations, characterized by a 
power-law decay k nn (k,N) ~ A 2/5 - 1 fc- 2+1 / /3 . This ex- 
ponent was previously obtained by scaling arguments in 
Ref. jijj. The dependence of the prefactor on N im- 
plies that k nn (k, N) diverges in the thermodynamic limit 
N — > 00, in agreement with the theoretical arguments 
provided in Ref. |45j. For finite N, however, the logarith- 
mic term with constant prefactor can induce corrections 
to the power-law scaling. Since 2/3 — 1 is at most 1, the 
growth of k nn {k,N) is not very steep with TV and these 
corrections are observable in numerical simulations, as 
we will see below in this Section. 

(ii) a = (i.e. \3 = 1/2, 7 = 3). For this value 
of j3 Eq. IL'l'l) is dominated by a logarithmic divergence, 
J2j=i r 1 - lis, yielding 



m 

R s (s) ~ — Ins. 



; From here, we obtain 



R s {t) 



711 

~~2 



lnt, 



(26) 



(27) 



and finally 



a very weak dependence on k in the k nn (k,N) function, 
compatible nevertheless with the behavior given by our 
rate equation approach in the large k limit. This k depen- 
dence, evidentiated at small values of the degree, cannot 
be detected within our approach, since it has been for- 
mulated in the continuous k approximation. 

(iii) a > (i.e. [3 < 1/2, 7 > 3). In this situation, the 
summation in Eq. (J22J), ELii~ 2/3 - s 1_2,3 /(l - 2/3), is 
divergent, and therefore R s (s) becomes independent of 
s. This leads to 



R s (t) ~ (3{m + af 



13 



hi 



/3(to + af 



and finally the dominant behaviour for the correlation 
function is 



/3 



k nn (k, N) ~ (m + a) In 



to + a 



(30) 



In this case, k nn (k, N) is independent of the network size, 
and increases logarithmically with k: For 7 > 3, LPA 
yields networks with weak assortative mixing. 



B. Three vertex correlations 

In order to estimate three vertex degree correlations by 
means of the clustering spectrum c(k), we start from the 
rate equation Eq. (JTDJ, which for the LPA model takes 
the form 



k nn (k,N)~ - In AT. 



(28) 



That is, two vertex correlations are independent of the 
degree and grow with the system size as In AT, in agree- 
ment with the behavior expected for an uncorrelated 
scale- free network with degree exponent 7 = 3, Eq. J3Jl. 
Numerical simulation of the B A model [2^ show actually 



dM s (t) . ,,k s (t)+a kj(t)+a 

— =m{m — IH r- > -r- r- 

dt J (2m + a)t (2m + a)t 

k (t) + a 

= m(m-l) [R s (t)+ak s (t)]. (31) 

(2m + a) z r z 

The boundary condition M s (s) can be written as 



M s (s) 



m(m — 1) 



^n,({fc}, s )n„({fc}, s )n,-„ 



3,n 

/3 2 (to- l)(TO + q) 3 2/3 _ 2 
2(2to + o) S 

/3 2 (m - 1)(to + af 
2(2to + o) 



e r x +Er v E 



n— 1 j'=n,+l 



j=l n=j+l 



, 2 "- 2 x2x^n- 2 ' 5 E ^ 

n— 1 j—n-\-l 



(32) 



In order to solve Eq. I|31|) . we approximate fc s (i) and by their dominant terms for large t and s, as computed 

above for the different possible values of a. 



6 



(i) — to < a < 0. In this case we have 

k s {t)~{m + a) R s (t) ~ /3C(2/?)(to + a)V s^" 1 . 

Introducing this expression into Eq. H31[). we obtain at leading order 



(33) 



M s ii) * ^ m ( -^t a) ff\ ^ ~ + *.(«)• (34) 

(2p — 1)(2to + a) 

In order to compute M s (s), we observe that the double summation in Eq. I|32[l takes the form at large s 

s s s 

S = J2 n ~ 2fJ ^ - ^^{Xns-lnn) ~ C(2/3)lns, (35) 

n— 1 j—n-\-l n—1 



since X)^=i 71 2 l nn is convergent for /3 > 1/2. Thus we obtain 

( m -l)( m + a)3 s 2/3-n s -i , g2 (m - l)(m + a) 3 c(23) *0-2 ln 

M *V>-P (2/3-l)(2TO + a r S jS + ^ 2to + o WjS ' 

and from here the expression for the three vertex correlation function follows: 



(36) 



c(k N) ~ ^^(TO-^^ + a) 3 - 1 ^ 2g _ 2 2+m 
C(fc ' 7Vj " (2/3-l)(2ro + a) A * 

2/3 2 C(2/3)(to- l)(m + a) 5 - 2 /^ 9fl , ,, 9fl , . 

+ M ^ V MA - — — - \nNN 2l} ~ 2 k' A+w . (37) 

2to + a 

To understand the asymptotic behavior of c(k, N), two limits have to be taken, corresponding to large TV and large 

k: 

• At fixed and large N, the leading behavior at large k is c(k, N) ~ N 2l3 ~ 2 k~ 2+1 < 1/3 . 

• At fixed k < (\nN) f3 / {213 -^ and large N, the leading behavior is instead c(k,N) ~ N 213 - 2 lniVfc" 4+2//3 . 

Therefore, in the numerical simulations we should expect to observe a crossover between these two scaling regimes, 
(ii) a = 0. We now have 



which yields 



16s 

Since = 1/2, M s (s), as given by Eq. i|32|) . can be easily shown to be 

to 2 (to — 1) 2 



k s (t)-mJ~, R s {t)~—J-lnt, (38) 



M s (t) ~ — ^ (In 2 t - In 2 s) + M s (a) . (39) 



M a («) = —*- ^s, (40) 

16s 



and we obtain 



TO 2 (TO — 1) 

which results in a clustering coefficient at large N 



M.(*)^ '" V ;"" X; ln 2 f, (41) 
16s 



/, ,r\ to — 1 In 2 iV , 
c(fc,7V)~_ _ (42) 

We recover the well-known result for the BA model that the clustering spectrum is constant, and scaling as In 2 N/N, 
as observed in Ref. j34[. It is worth noting that the computation of the boundary condition l|4l)[) is essential in recovering 
this result. 
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(iii) a > 0. For this range of values of a we have 



yielding 



/,-„(/) - (m -:-,,) ( ~\ , R s {t) ~ (3{m + a) 2 (~\ In (- ) . (43) 



j2 (m-l)(m + a) 3 _ 2/3 J 2/3 _ x , - 



M ^ * ^2m + a) (1 -2V l"** * UJ + 1-2/? > + ^ (44) 



For the evaluation of M s (s), we observe that the double summation S defined in Eq. (|35|) shows now a power-law 
divergence, S ~ s 1_2,3 /(l — 2/3) 2 . Thus we have 

(m-lXm + q) 3 _ w f 2 „_, /t\ 2 s M-l_;i8-l 



yielding a three vertex correlation function 



(2m + a)(l — 2p)^ 

Therefore, for a > (i.e. /3 < 1/2), we obtain that the average clustering of the vertices of degree k is a growing 
function of fc, scaling as c(k,N) ~ A~ 1 fc~ 2+1 / /3 . Since by definition the clustering must be smaller than 1, this 

growing behavior must be restricted to degree values k < N^/^fK 

I 



C. Computer simulations 

In order to check the analytical results obtained in this 
Section, we have performed extensive numerical simula- 
tions of the LPA model. Simulations were performed for 
system sizes ranging from N = 10 3 to N = 10 6 , av- 
eraging over 100 network samples for each value of N 
and a. We focus in particular in the ranges a < and 
a > 0, which have not been previously explored (for nu- 
merical data corresponding to a — 0, the BA model, see 

Refs. HEH). 

In Figs, ^and [21 we explore the behavior of networks 
generated for a < 0. We consider first the average de- 
gree of the nearest neighbors k nn (k,N). Fig.QJa) corre- 
sponds to m = 4, a = —2, values that yield j3 = 2/3 
and 7 = 2.5, while Fig. D^b) plots data for m = 4, 
a = —3, corresponding to (3 — 4/5 and 7 = 2.25. The 
dashed lines represent the power law behavior fc^ 2 + 1 // 3 
expected analytically. We observe that, as the size of 
the network increases, the data follow the predicted scal- 
ing k nn (k,N) ~ jy2/3-l£-2+i//J on larger and larger 
ranges. Nevertheless, the logarithmic corrections present 
in Eq. I|25fl are clearly visible from the large k deviations 
shown by the data (middle plots in Fig.QJ. The logarith- 
mic correction can, in fact, be taken into account if one 
rescales k nn {k,N) appropriately. Namely, if we define 

k™ c (k, N) = k nn (k, N) - (m + a) In ( — *—) , (47) 

\m + a J 

then, from see Eq. Q25f) . we expect 

V°* c (k, N)^- 2 ? ~ k~ 2+1 ^. (48) 



In the bottom plots of Fig. we draw the rescaled aver- 
age degree of the nearest neighbors with logarithmic cor- 
rections. The collapse of the data is indeed surprisingly 
good, given the numerous approximations and leading 
order cancellations made in our calculations. The re- 
maining discrepancy at very large k is presumably due 
to the subdominant terms we have neglected. 

In Fig. |21 we represent the clustering spectrum c(k,N) 
for the same parameters as before, i.e. m = 4, a = —2 
(a) and m = 4, a = —3 (b). The top plots repre- 
sent the corresponding nonrescaled raw data. Accord- 
ing to the solution provided in Eq. (|37|l . for small values 
of k it is expected an asymptotic scaling of the form 
c(k,N) ~ N 2 ' 3 ~ 2 \nNk~ 4+2 / fi . This behavior is well 
recovered in the bottom plots in Fig. |2] for both val- 
ues of a, where we can see that the first points in the 
graphics for different values of N collapse onto the same 
curve, with the predicted k dependence. For large val- 
ues of k, on the other hand, we expect instead a scaling 
c(k, N) ~ N 2 @~ 2 k~ 2+1/ 1/3 , which is again recovered in the 
middle plots of this Figure, showing a better collapse in 
the intermediate range of k values. At very large k val- 
ues, finally, the neglected logarithmic terms come into 
play, affecting the scaling of the data. It is important to 
notice the important role played by the boundary condi- 
tion Eq. 1|11|) , which is responsible for the second term in 
Eq. I|37|) . giving the correct scaling behavior for small k. 

In Fig. |21 we finally explore the average degree of the 
nearest neighbors (a) and the clustering spectrum (b) 
for the LPA model with a > 0. We focus in particu- 
lar on the values m = 4 and a = 2 (top plots), yielding 
(3 = 2/5, 7 = 3.5; a = 5 (middle plots), with /? = 4/13, 
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FIG. 1: Average degree for the nearest neighbors of the vertices of degree k, k nn (k, N), for the LPA model for m = 4, with 
a = -2 (a) and a = -3 (b). Symbols correspond to the different system sizes N = 10 4 (o), N = 3 x fO 4 (□) N = fO 5 (o) 
AT = 10 6 (A). Top plots: Raw data. Middle plots: Data rescaled by the size prefactor jV 1_2/3 . Bottom plots: Data rescaled by 
the size prefactor with logarithmic corrections. The dashed lines represent a power-law decay with exponent —2 + 1//3. 




FIG. 2: Clustering spectrum c(k,N) for the LPA model for m = 4, with a — —2 (a) and a — —3 (b). Symbols correspond to 
the different system sizes N = 10 4 (o), N = 3 x 10 4 (□) N = 10 5 (o) N = 10 6 (A). Top plots: Raw data. Middle plots: Data 
rescaled by the size prefactor N 2 ~ 2/3 , corresponding to large k. Bottom plots: Data rescaled by the size 

corresponding to small k. The full and dashed lines represent power-law decays with exponent —2 + 1//3 and —4 + 2//3, 
respectively. 



7 = 4.25; and a — 10 (bottom plots), that corresponds to 
(3 = 2/9, 7 = 5.5. For the k nn (k, N) function our theoret- 
ical analysis predicts a function independent of the net- 
work size, and slowly (logarithmically) growing with the 
degree. These predictions are confirmed in Fig. Efa). It is 
noteworthy that the theoretical prediction becomes more 
accurate for large a: While the collapse is quite good for 
a > 5, the graphs are a bit scattered for the smallest 
value of a considered. This fact is due to the slow con- 
vergence (as N grows) to the theoretical asymptotic form 
for small a. Analogously, the clustering spectrum shows 
the predicted scaling c(k,N) - N^k-^ 1 /?, Fig.^b). 
The dependence on system size is correctly captured by 
our analysis for larger values of a. In this range, however, 
the power-law dependence on k seems to depart from the 
theoretical exponent —2 + This apparent departure 



can be due to the limited range of degrees for such large 
values of the degree exponent (the degree range decreases 
for increasing a), as well as to the subdominant terms ne- 
glected in the asymptotic expression Eq. JJBJ. 



IV. GROWING NETWORKS WITH LARGE 
CLUSTERING 

As we have seen in the previous Section, the LPA 
model yields a clustering spectrum c(k) that, even if 
presenting a non-trivial scaling, vanishes in the thermo- 
dynamic limit, i.e., limjv^oo c(fc, N) = 0. However, for 
many complex networks, such as the Internet 0, we ob- 
serve a function c(k) scaling with fc, together with a finite 
clustering. 
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FIG. 3: Average degree for the nearest neighbors of the vertices of degree k, k nn (k,N) (a) and clustering spectrum, c(k,N) 
(b) for the LPA with m = 4 and positive values of a. Symbols correspond to the different system sizes N = 10 4 (o), N = 10 5 
(□) TV = 10 6 (o). Top plots: a — 2. Middle plots: a = 5. Bottom plots: a — 10. Data for c(k,N) has been rescaled by the 
theoretical size prefactor N. The dashed lines represent a power-law behavior with exponent —2 + 1//3. 



Several models have been proposed which reproduce 
this feature. In particular, Dorogovtsev, Mendes and 
Samukhin (DMS) introduced in Ref. [41j a scale-free 
growing network with large clustering coefficient C. The 
model is defined as follows: At each time-step, a vertex is 
added and connected to the two extremities of a randomly 
chosen edge, thus forming a triangle. The resulting net- 
work has a power-law degree distribution P(k) ~ k~ 3 , 
with (k) = 4, and since each new vertex induces the cre- 
ation of at least one triangle, we expect this model to gen- 
erate networks with finite clustering coefficient. We con- 
sider here a generalization of the DMS model, in which 
every new node is connected to the extremities of m/2 
randomly chosen edges, where m is an even number. The 
original model corresponds thus to m = 2, and this gen- 
eralization allows to tune the average degree, setting it 
to (fc) = 2m. 

It is important to notice that this model actually con- 
tains the LPA mechanism in a disguised form. Indeed, 
the probability to choose a vertex s is clearly proportional 
to the number of edges arriving to s, i.e. to its degree 
k s . At time t there are mt edges so that k s = 2mt 
and the probability to choose s when choosing one edge 
is k s /(mt) {^2 s k s /{mt) = 2 since one chooses indeed 2 
vertices). This process is repeated m/2 times and thus, 
at each time step the probability to choose s is k s /(2t). 

This shows that another way of formulating the ran- 
dom choice of an edge is in fact the following: a vertex 
s is chosen with the usual preferential attachment prob- 
ability k s / (2mi), and then one of its neighbors is chosen 
at random, i.e. with probability l/k s . 

It is then clear that the rate equation for the degree is 
given by 

dksit) _ k.(t) 

- (49) 

leading to k s (t) = ra^/s) 1 / 2 and to a scale free degree 



distribution of the form P(k) ~ 2m 2 /c~ 3 . 

We are now in position to write down the rate equa- 
tions for the network correlations, taking into account 
that, each time a vertex is chosen, so is one of its neigh- 
bours. 



A. Two vertex degree correlations 

At each time step, R s (t) can increase either if the ver- 
tex s is chosen (and then R s increases by m + 1 because 
a neighbour of s is also chosen), or if a neighbour j is 
chosen together with a neighbour I of j, with I ^ s (and 
then R s increases by 1). Therefore, we have that 




This is exactly the same equation than for the LPA with 
a = 0, i.e. the BA model. Moreover, the boundary 
condition for R s (s) can be written as 

i=i I iev(i) J ) 

(51) 

where, for each one of the m/2 edges chosen by s, the 
first term corresponds to the contribution of j (chosen 
with probability kj/(2ms), and the second term to the 
contribution of a neighbour I of j, which is chosen with 
probability l/kj . This expression is easily reduced to 

R s (s) = \ £ ^ (S)[ ^ (S) + 1] * ^ hi a. (52) 
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Once again we obtain the same result than for the LPA 
with a = 0. The conclusion is that the k nn (k,N) func- 
tion for the generalized DMS model is given by Eq. (|28(l : 
the two vertex correlations are independent of the degree 
and growing with the network size as In TV, in the same 
fashion as the BA model. 



B. Three vertex correlations 

In order to write down the rate equation for M s (t) , we 
have to take into account that, at each time step, m/2 
triangles are formed by the choice of m/2 edges, and that, 
moreover, additional triangles may be formed for m > 2 
by choosing two different edges with a common vertex. 
At each time step, the increase in M s (t) is thus given by 
two terms: 

• The first one comes from choosing the vertex s with 
probability k s {t)/{2t). In this case, M s increases by 
1, since one of the neighbours of s is also chosen. 

• The second contribution comes from the following 
situation: one of the edges chosen is s — I, and 
another one is j — I', with j G V(s), j =/= I and 

The resulting rate equation reads: 

dM s (t) k s (t) m /m \ k s (t) 
It ~ 2t + ~2 V ~2 ~ ) mt 
„ \- kj(t) (. 1 



mt 

j'evO) 



1 



k s (t) 



k s (t) m — 2 



+ 



[k s (t) - l]R s {t). (53) 



2t ' Amt 2 

We use k s ~ my/t/s and R s ~ m 2 \atyjtjs /2 to obtain 
dM s (t) m m 2 (m - 2) In t 



dt 2y/E 
whose solution reads 

't . \ m 2 (m — 2) 



8si 



(54) 



M s (t) ~ m 




16s 



(ln 2 t-ln 2 s)+M s (s). 

(55) 

The boundary condition is again given by two contri- 
butions: First, m/2 triangles are created by attaching s 
to m/2 edges. The second contribution is given by 



kj{s) 



3=1 iev(j) 



1 



kj(a) 



ms 



(56) 



i.e. the sum over all vertices j of the probability that, 
among the m/2 edges chosen by the new node s, one has 



j for extremity, and another one has a neighbour I of j 
for extremity. This yields 



M s (s) 



m m 
1 + ~" 



4ms 2 

3=1 

m m 2 (m — 2) In 2 s 
1 

2 8s 



(57) 



and finally 



M s {t)~m x l~-- 



t m m 2 (m — 2) 
16s 



(In 2 t + In 2 s) . (58) 



The clustering spectrum can therefore be written as 
2fc-m - ° / s™ 2 



c(k, N) 



k(k - 1) 



m - 2 A 2 at ,2 ' m N 
In TV + In 2 

8A^ 



(59) 

The clustering spectrum is now finite in the infinite 
size limit, 

) ft* TYl 

m =^J { k,N)^ w - Y) . (60) 

It is interesting to see that, for the original model with 
m = 2, the finite-size corrections actually vanish and we 
obtain the result c(fc, N) — 2/k, independent of N. This 
scaling is also similar to that obtained for the Holme-Kim 
model in (34|. The knowledge of the exact form of the 
degree distribution for m = 2, P(k) = 12/(fc(fc+l)(fc+2)) 
|4lj , allows us to obtain the average clustering coefficient 
C(m = 2) = 2?r 2 - 19 w 0.739. More generally, for 
large m, approximating P(k) by 2m 2 /k 3 , and sums by 
integrals, yields 



C(m) 



2m 



P(k)c(k) dk 
6m — 4/3 + 2m 2 (2-m) In 



(61) 



C. Computer simulations 

We have performed extensive numerical simulations of 
the generalized DMS model studied in this Section. We 
focus on the clustering spectrum c(k,N) since the re- 
sults for k nn (k) are expected to be equal to the case of 
the BA model. Fig. Ola) shows the excellent agreement 
between the predicted behaviour, Eq. I|59(l . and the nu- 
merical data for various values of m and sizes ranging 
from N = 10 4 to N = 10 6 . As expected, no finite size 
corrections are present for m = 2, while they are cor- 
rectly described by the analytical approach for larger m. 
Moreover, the prediction for the average clustering coef- 
ficient C(m), Eq. H61fl . is also shown to be in excellent 
agreement with numerical data, Fig. Ilfb). 



11 




FIG. 4: (a) Clustering spectrum c(k, N) for the generalized 
DMS model. The top plot corresponds to a systems size N = 
10 4 ; the bottom plot is for N = 10 6 . Symbols correspond to 
different values of the average degree: m = 2 (o), m = 4 (□), 
m = 8 (o). The dashed lines are the theoretical predictions 
given by Eq. 1591 . (b) Average clustering coefficient C(m) as 
a function of m, for networks of size N = 10 J . The full line 
represents the theoretical prediction Eq. IbH . The dashed 
line marks the theoretical value for m = 2, (7(2) = 2ir 2 — 19. 



V. WEIGHTED GROWING NETWORKS 

In the previous Sections we have applied the rate equa- 
tion formalism to analyze two and three vertex correla- 
tions in standard models with either vanishing or con- 
stant clustering coefficient. The formalism for the two 
vertex correlations, however, is not limited to these par- 
ticular cases, and can be easily extended to analyze more 
complex growing network models. As an example, in 
this Section we will consider a recently proposed grow ing 
weighted network model ^3 • Weighted networks |47j are 
a natural generalization of graphs in which a real quantity 
is assigned to each edge, representing the importance or 
weight Wjj of the interaction between the vertices i and j. 
Recently [48j ] , it has been pointed out that real weighted 
networks present a complex architecture, characterized 
by broad distributions of weights, as well as nontrivial 
correlations between the values of the weights and the 
topological structure of the network. 

Motivated by these findings, Ref. [43j proposed a dy- 
namic growing weighted network model, in which new 
edges are attached to old vertices with a connection prob- 



ability depending on the strength, or total weight, of 
the vertex. In order to define the model, let us con- 
sider a weighted network characterized by the elements 
Wij defining the weight assigned to the edge connecting 
vertices i and j . We assume the elements Wij to be sym- 



metric, that is, Wi 



Each vertex i is characterized 



by both its degree fc, and its strength <x,-, defined as 

Oi = ^ Wij. 



(62) 



Sij, we obvi- 



For non- weighted networks, in which Wq 
ously recover Oj = fcj. The model proposed in Ref. |43j 
considers a growing network in which at each time step, a 
new vertex is added to the system and connected with m 
edges to older vertices. The probability that the new ver- 
tex t is connected to s (s < i) is given by the connection 
probability 



n s ({s},i) 



<Js{t) 



(63) 



that is, linearly proportional to the strength of the old 
vertex s. Each new edge carries an initial weight wq = 1. 
Additionally, there is a dynamic rearrangement of the 
weights belonging to the edges of the receiving vertex: 
When the vertex s receives a new connection, the weight 
of its edges is increased by an amount 



J sj 



5^, jeV(s). 



(64) 



This rule implies that, for each new vertex added, 
the total strength of the network is increased by an 
amount 2m + 2m8, therefore the normalization constant 
in Eq. is J^j °iW = 2m(l + S)t. It can be shown, 
within the continuous k approximation |43j |. that this 
model generates scale-free networks, characterized by the 
quantities 



a s (t) = m I - 
s 







with exponents 



= 



k s (t) 



26+1 
26 + 2' 



a s (t) + 2m6 
26+1 



7 = 



45 + 3 
26 + 1' 



, P(fc)~fc-T, 

(65) 
(66) 



Therefore, for 6 > 0, this model yields power-law degree 
distributions with degree exponent 7 g]2, 3[ and f3 > 1/2. 
The case <5 = recovers the BA model. 

It is easy to see that, at the level of the mean field 
rate equations in the continuous k approximation, the 
weighted growing network model described above can be 
mapped into a growing network with LPA and negative 
parameter a given by 



2m6 
26 + 1' 



(67) 
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Therefore, we expect to observe the two and three vertex correlation functions 



r 



A. Weighted two vertex correlations 

The definition of the k nn {k) function we have com- 
puted above completely neglects the effect of the weights. 
Therefore, it provides a biased view of real correlations 
in the system (for example, two neighbors with the same 
degree but widely different weights give the same contri- 
bution). In order to take into account the effect of the 
weights associated to the edges, it has been proposed a 
new correlation measure, the weig hted average degree of 
the nearest neighbors, fc™„(fc) |48|. defined as follows: 



1 



cr s (t) ^ 



(70) 



This definition implies that fc™„(s) > k nn {s) if the edges 
with largest weight point to the neighbors with largest de- 
gree, while k™ n (s) < fc, m (s) in the opposite case. There- 
fore, k™ n (s) measures the effective affinity to connect 
with large or small degee neighbors, according to the 
magnitude of the interaction weight. The weighted aver- 
age degree of the nearest neighbors fc™„(fc), is defined as 
the average of k™ n (s) for all the vertices with the same 
degree k. 

We can study analytically the weighted two vertex cor- 
relations by seeking a rate equation for the quantity 



Q a (t) 



w sj (t)kj(t) 



(71) 



According to the rules defining the model, at each time 
step Q s (t) can increase its value by two mechanisms: 

• If a new vertex is directly attached to s, Q s (t) in- 
creases by an amount m + SQ s /a s 

• If a new vertex is attached to a neighbour j of s, 
then Q s (t) increases by w S j +6w S j/<jj + Sw S jkj / 'oj 

Therefore, the rate equation fulfilled by Q s (t) is 



dQs{t) = mIL s ({a}, t)(m+ -^Q.(t) 



dt 



£ -n i (W,t)^, + ^ + ^^](72) 



It's 



kj(t) 



which, in terms of a s (t) and Q s (t), yields 

dQ s {t) _ ( _5_\ Q s {t) m + 6-2mS a s (t) 
' 1 1 r 1 * 2{1 + S) t ' 



dt 



1 + 5 J t 



(73) 

Inserting the value of a s (t) given by Eq. (|tj5|) , the general 
solution of this equation is 

Q s (t)^A ( S )t^ s /^-y § (m + S-2mS) (tY. (74) 

Since all new edges have an initial weight wq = 1, the 
initial condition for Q s (t) coincides with that of R s (t). 
Solving for Aq(s) from Eq. (|23|l . substituting for the cor- 
responding value of a given by Eq. Ij67(l , we finally obtain 
in the large k and TV limit 



mC(2/3) 



2(l + *)(2* + l) 



N 2f3- 



(75) 



That is, in this model the weighted average degree of the 
nearest neighbours is independent of k, signaling the ab- 
sence of two vertex weig hted correlations, as indeed found 
numerically in Ref. .49]. There is, however, a scaling with 
the system size, given by the factor N 2 " -1 , which is the 
same as that found for the nonweighted correlations for 
the same value of 7. 



B. Computer simulations 

We have performed numerical simulations of the 
weighted growing network model described in Ref. |43j . 
for sizes ranging from N = 10 3 to N = 10 5 , focusing on 
the behavior of the average degree of the nearest neigh- 
bors, for both its non- weighted and weighted versions. In 
Fig. [5] we plot the average degree of the nearest neighbors 
k nn (k, N) for m — 2 and 5 = 2 (a) which corresponds to 
a network with [3 — 5/6, 7 = 2.20, and 6 = 5 (b), that 
yields (3 — 11/12, 7 = 2.09. As expected from the ana- 
lytical analysis performed above, the obtained scaling is 
analogous to the LPA model: the numerical data follows 
the predicted form k nn (k, N) - A 2 ' 3 - 1 fc- 2+1 / /3 . The bot- 
tom plots highlight the presence of the logarithmic cor- 
rection of Eq. i|fci8|) , by plotting the rescaled function 



k™ c (k,N)=k nn (k,N) 



26 + 1 



In 



25+ r 



m 



(76) 
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FIG. 5: Average degree for the nearest neighbors of the ver- 
tices of degree k, k nn (k,N), for the weighted growing model 
for m = 4, with 5 — 2 (a) and <5 = 5 (b). Symbols correspond 
to the different system sizes N = 3 x 10 3 (o), N = 10 4 (□) 
N = 3 x 10 4 (o), N — 10 5 (A). Top plot: Raw data. Mid- 
dle plot: Data rescaled by the size prefactor jV 1-2 ' 9 . Bottom 
plot: Data rescaled by the size prefactor with logarithmic cor- 
rections. The dashed lines represent a power-law decay with 
exponent —2 + 1//3. The middle plots display also the val- 
ues of N 1 ~ 2/3 k^ n (k,N) (filled symbols), which collapse onto 
a horizontal line, in agreement with the analytical prediction 
Eq. f75|. 



In this case, it is noticeable that the rescaled A^ c (fc, N) 
function with logarithmic corrections yields a better data 
collapse than that shown by the LPA model. Even 
though both models are identical at the mean field level, 
the existing microscopic differences seem to yield smaller 
subleading corrections for the weighted growing network 
model. 

For this same set of parameters, we have also evalu- 
ated the weighted average degree of the nearest neigh- 
bors, k™ n (k,N), shown in the middle plot in Fig. [5fa) 
(filled symbols). We observe that the k™ n (k,N) is in- 
deed, as expected, independent of k, and scales with the 
system size with the predicted factor iV 2/3_1 . 



VI. CONCLUSIONS 

A complete theoretical characterization of a growing 
network model should imply not only the estimation of 
the corresponding degree distribution, but also an ana- 
lytical study of the functional form of the correlations 
between the degrees of neighborin g ve rtices. Capitaliz- 
ing on the work of Szabo et al. |34l39| , in this paper we 
have provided a formalism to compute two vertex corre- 
lations, expressed by means of the average degree of the 
nearest neighbors of the vertices of degree k, k nn {k), valid 
for growing network models generated by means of the 
preferential attachment mechanism and belonging to the 
class of the so-called "citation networks" . The formalism 
is based on a rate equation in the continuous k approx- 
imation, together with the appropriate boundary condi- 
tion, that can be easily solved in the case in which the 
preferential attachment is linear in the degree. Addition- 
ally, we have presented a more complete description of the 
rate equation determining the clustering spectrum c(fc), 
by discussing the effects of boundary conditions. Apply- 
ing this framework to several growing network models, we 
have obtained asymptotic expressions for the functions 
k nn (k, N) and c(k, N), evidentiating both the degree de- 
pendence and the scaling with the system size, due to 
finite size effects. As a general result, we conclude that 
networks generated by LPA with degree exponent 7 < 3, 
exhibit the scaling behavior 

fc„„(fc,7V)~iV 2 ' 3 - 1 fc- 2+1 ^, (77) 

previously obtained by means of scaling arguments |4C| , 
which is the signature of disassortative (negative) two 
vertex correlations. We have also been able to identify 
the presence of logarithmic corrections in models with 
LPA, which clearly appear in computer simulations of the 
model. For this LPA model, we also observe the presence 
of small assortative correlations for degree exponents 7 > 
3, characterized by a logarithmic growth of the k nn (k, N) 
function, which is otherwise independent of the network 
size. The situation is more complex in what concerns 
the clustering spectrum c(k,N). For 7 > 3, we observe 
the presence of a crossover between two power-law decays 
in the degree, c(k) ~ k~ a , with a = —4 + 2/(3 for k < 
(lniV)^ 2 ^ 1 ), and a = -2+1/73 in the asymptotic limit, 
while for 7 > 3 we obtain an increasing c(fc, N) function, 
limited by an upper degree cutoff. 

^,From this results we can conclude that the value a ~ 1 
observed in the literature [3^ |34( is not a generic feature 
of all scale-free networks [39|. However, we notice that 
LPA yields networks with a vanishing clustering coeffi- 
cient. In order to assess the possible effects of this fac- 
tor, we have considered the DMS model [4JJ, that gen- 
erates networks with a large value of C, as observed in 
real networks. In this case, we obtain a lack of two ver- 
tex correlations, while the clustering spectrum scales as 
c(k) ~ k^ 1 . An analogous result is obtained for the simi- 
lar Holme-Kim model [3 03 • From this result we could 
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be tempted to conclude that a clustering spectrum scal- 
ing with exponent a = 1 is related to the lack of corre- 
lations. However, this explanation would be in conflict 
with the empirical observation of this exponent in real 
networks with clear dissasortative mixing. More work 
is therefore needed in order to elucidate the relations be- 
tween the scaling exponents of k nn (k) and c(k) in general 
complex networks. 

As a final point, we have shown the flexibility of the 
rate equation approach to compute two vertex corre- 
lations by applying it to a recently proposed weighted 
growing network model, in which edges are further char- 
acterized by a distribution of weights that is dynamically 
coupled to the evolving topology of the network. For this 
model, we are able to extend our formalism to deal with 
weighted two vertex correlations, which measure the ef- 
fect of the strength of the interactions between neighbor- 
ing vertices. 

The very good agreement shown between our analyti- 
cal estimates and numerical simulations suggest that the 
method proposed in this paper to compute two vertex 



correlations is in general valid to characterize growing ci- 
tation network models. An obvious improvement would 
be to extend it to deal with models in which vertex and 
edge removal, and edge rewiring, are allowed. This in- 
clusion, however, would probably lead to quite complex 
non-local rate equation, whose solution would be much 
harder to tackle. 
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